PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr1P04360_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family HD-ZIP
Protein Properties Length: 302aa    MW: 33174.2 Da    PI: 9.8926
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr1P04360_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.31.3e-18139193256
                            T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                            rk+ +++k+q  +Lee F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  GSMUA_Achr1P04360_001 139 RKKLRLSKDQSAILEESFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTK 193
                            788899***********************************************98 PP

2HD-ZIP_I/II1284.1e-41139228191
            HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLr 88 
                            +kk+rlsk+q+++LEesF+e+++L+p++K +la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l++en+rL+kev+eLr
  GSMUA_Achr1P04360_001 139 RKKLRLSKDQSAILEESFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKEVQELR 226
                            69*************************************************************************************9 PP

            HD-ZIP_I/II  89 eel 91 
                             +l
  GSMUA_Achr1P04360_001 227 -AL 228
                            .55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046184.1E-291112IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.605.5E-18130193IPR009057Homeodomain-like
SuperFamilySSF466891.15E-18130196IPR009057Homeodomain-like
PROSITE profilePS5007117.265135195IPR001356Homeobox domain
SMARTSM003891.6E-16137199IPR001356Homeobox domain
PfamPF000464.8E-16139193IPR001356Homeobox domain
CDDcd000861.90E-16139196No hitNo description
PROSITE patternPS000270170193IPR017970Homeobox, conserved site
PfamPF021833.3E-11195229IPR003106Leucine zipper, homeobox-associated
SMARTSM003407.6E-25195238IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 302 aa     Download sequence    Send to blast
MMGKDDLGLS LSLSSSSHHH HLPPQLHLMP PSSSPAASVP LPSPPFPCHQ RTQPVVDMSG  60
GAAEARSLPR LRGIDVNRAP AGATERDSEE DAGTSSPNST LSSVSGKRGE RDHHLGDELD  120
PDRACSRGIS DEEDGDGSRK KLRLSKDQSA ILEESFKEHN TLNPKQKLAL AKQLNLRPRQ  180
VEVWFQNRRA RTKLKQTEVD CEFLKRCCET LTDENRRLQK EVQELRALKL SPQSVQSTKS  240
CLPSPVLHAS VSRLNGPRSA GSFVTPSDLI GGQKASGRPH HYCARSAHHK TGSRRRRSPP  300
SE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1137143SRKKLRL
2187195RRARTKLKQ
3284296RSAHHKTGSRRRR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009388885.11e-145PREDICTED: homeobox-leucine zipper protein HAT4-like
SwissprotP466018e-82HAT2_ARATH; Homeobox-leucine zipper protein HAT2
TrEMBLM0RWZ40.0M0RWZ4_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr1P04360_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP25873889
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16780.15e-60homeobox protein 2